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TO/PCTite'd 19 SEP 2001 

FUNCTIONAL I ZED COMPOUND, OPTIONALLY LABELED POLYNUCLEOTIDE 
AND METHOD FOR DETECTING A TARGET NUCLEIC ACID 

The present invention relates to a novel 
5 nucleoside or nucleotide compound f unctionalized' with an 
alkyl ketone group, to a polynucleotide comprising at 
least one nucleotide unit f unctionalized with an alkyl 
ketone group, before and after labeling, and also to the 
implementation and the uses of these products in 
10 particular for detecting nucleic acid sequences. 

In the nucleic acids field, the synthesis of 
f unctionalized nucleotides has been described in 
particular in the diagnostic field and more particularly 
for preparing labeled nucleic acid probes which may be 
15 used for detecting a target nucleic acid. 

Two main types of problem arise in producing these 
probes. Initially, the f unctionalized nucleotide must be 
incorporated into a polynucleotide. Next, the function 
carried by the nucleotide must be able to react with a 
20 tracer in a way which is specific and effective for using 
the polynucleotide as a detection probe. 

Thus, EP-A-0 407 816 describes uracil /derivatives 
modified at position 5 for producing probes via the 
chemical or enzymatic route. Still with the same aim, 
25 patents WO-A-86/06726 and EP-A-0 212 951 describe cytosine 
derivatives modified at position 4. Patent EP-A-0 254 646 
describes an adenosine derivative modified at position 8. 

Patent WO-A-92/00989 describes a specific use of 
modified nucleotides for introducing proteins onto a 
30 polynucleotide . 

Patent application WO-A- 98/05766 by the applicant 
poses the problem of the incorporation of f unctionalized 
nucleotides which may be incorporated using enzymatic 
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reaction and in particular using enzymatic amplification 
techniques, no longer for preparing labeled probes, but 
directly for generating a labeled target. In this case, 
there is a greater need for sensitivity and the choice of 
5 the f unctionalized nucleotide is essential for producing 
the appropriate sensitivity. 

A certain number of nucleophilic functions, such 
as amine and alkoxyamine functions, or electrophilic 
functions, such as the aldehyde function, are described 

10 which allow effective incorporation of the f unctionalized 
nucleotide in the course of amplification, but there 
nevertheless remains a need for a f unctionalized 
nucleotide which is even more effective, in particular in 
terms of ease of preparation, in terms of neutrality with 

15 respect to the enzymatic or chemical reactions and in 
terms of reactivity for the labeling of this nucleotide 
with a tracer after incorporation. 

It has been found, by the applicant, that, 
surprisingly, the alkyl ketone function answers the 

20 abovementioned drawbacks. 

In fact, while the prior art describes the use of 
R-CO- units in which R is an alkyl group, in the case of 
oligonucleotide synthesis (WO-A-93/22326) , the aim of this 
function is to protect the exocyclic amine of the bases. 

25 At the end of synthesis, the amine is deprotected by the 
action of an alkali agent. 

By way of illustration of this state of the art, 
the following documents may be mentioned. The article by 
K.K. Ogilvie and M.J. Nemer, Tetrahedron Letters, vol. 21, 

30 (1980) pages 4145-4148, discloses, as an intermediate of 
synthesis of a nucleotide, a nucleotide compound carrying 
a levulinoyl group attached to the hydroxyl function on 




the 3' side of the pentose. The aim of introducing this 
group is to protect the OH function, and it is then 
immediately removed after said compound has been obtained. 
The article by C.T.J. Wreesmann et al . , Nucleic Acids 
5 Research, vol. 11, N°23, (1983) pages 8389-8405, describes 
the production of a synthetic dinucleotide intermediate, 
the hydroxyl functions of which are protected by the 
levulinoyl group. In the presence of aqueous ammonia, the 
hydroxyl functions are released. 

10 The alkyl ketone function as defined in the 

present invention is sufficiently stable to withstand this 
type of treatment and does not therefore represent a 
protective group, and its stability with respect to the 
various methods of chemical synthesis results in greater 

15 ease of preparation, in particular with respect to amine, 
aldehyde or alkoxyamine functions, as described in patent 
WO-A-98/05766 . 

Similarly, in the abovementioned patents 
(EP-A-0 407 816, WO-A-8 6 / 0 6 72 6 , EP-A-0 212 951, EP-A-0 254 646 

20 and WO-A-92/00989) , the functions described for 
f unctionalizing the nucleotides are chosen from functions 
which are reactive from a chemical point of view, such as 
amine and thiol functions, which are optionally protected. 
If these functions are protected, a deprotection step is 

25 necessary, which complicates the labeling step. If these 
functions are free, inhibition of the enzymatic reaction 
or of the side reactions in the case of chemical synthesis 
may occur. 

This chemical stability of the alkyl ketone 
30 function should not make it a good candidate for a 
reaction for coupling to a label, and yet the present 
invention demonstrates that, surprisingly, this function 
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is reactive with regard to labeling and that, in addition, 
the detection of the product after labeling is very 
sensitive . 

Finally, the nucleotides carrying this function 
5 exhibit excellent neutrality with respect to enzymatic 
reactions since it is possible to completely replace a 
natural nucleotide with a nucleotide carrying this alkyl 
ketone function in an enzymatic reaction, without 
affecting the yield of this reaction and, even more 

10 surprisingly, in certain cases improving it. 

Document WO-A- 95/24185 describes a nucleoside 
modified with an alkyl ketone group, the alkyl component 
of which may comprise up to 20 carbon atoms. This compound 
is in particular intended for the synthesis of 

15 oligonucleotides which find an application in therapy, in 
diagnosis. The introduction of a group, in particular an 
alkyl ketone group, onto the pyrimidine nucleus of the 
nucleoside described is not directed toward 
f unctionalization in view of a subsequent reaction of said 

20 group, but is aimed at producing oligonucleotide analogs 
which, with respect to the natural oligonucleotides, 
exhibit properties which are advantageous in terms of 
their uses, such as greater ease of hybridization with a 
target nucleic acid or greater resistance to nucleases. 

25 The object of the present invention is to describe 

a novel f unctionalized compound which comprises an alkyl 
ketone function and which has the formula (I) below: 

O 
II 

W — L CU 2 CR X (I) 

30 



in which 
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- W represents a nucleotide analog; 

- L represents a linker arm between W and the 
alkyl ketone group, comprising at least four atoms, 
advantageously at least eight atoms; L is in particular 

5 chosen from saturated or unsaturated hydrocarbon-based 
chains, optionally interrupted by at least one amine, 
amide or oxy function; preferably, the linker arm 
comprises a chain of 8 to 30 atoms; in particular, the 
linker arm comprises from 8 to 20 atoms, at least one of 
10 which is an amide function; 

- R x represents a linear or branched alkyl chain, 
preferably an alkyl chain having at most 6 carbons; 
advantageously, R x is a methyl group. 

The term "nucleotide analog" is intended to mean a 
15 nucleoside or a nucleotide, a nucleoside or a nucleotide 
carrying one or more modifications on one of the 
constituent elements of said nucleoside or nucleotide, for 
instance a modification of the deoxyribose or ribose 
sugar, in particular xylose, arabinose, sugars with an 
20 alpha configuration (FR 2 607 507) , PNAs (M. Egholm et 
al . , J. Am. Chem. Soc . , (1992), 114, 1895-1897), sugar 
analogs such as 4 ' - thio-ribose or -deoxyribose, or sugars 
with a D or L configuration;- a modification of the 
nitrogen-containing base; a modification of the phosphate 
25 or of its equivalent for the nucleotides and also all of 
the protective groups used in the chemical synthesis. 

Advantageously, the compound corresponds to the 
formula (I) in which W has the general formula (II) : 
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in which: 

- R 2 represents H or a protective group; 

5 - R 3 represents H, F, OH, SH, NH 2/ OCH 3 or OR 5 in 

which R 5 represents a protective group or an alkyl chain; 

- R 4 represents an H radical, a protective group 
or a mono-, di- or triphosphate group; 

- B represents a nitrogen-containing base, 
10 - W being attached to L via B. 

The nitrogen-containing base is chosen in 
particular from purines or pyrimidines, such as adenine, 
guanine, uracil, cytosine or thymine, or any other 
modified base allowing hybridization, for instance natural 
15 modified bases (such as 6-keto purine, xanthine, 5- 
methylcytosine or 2 -aminopurine) or unnatural modified 
bases (such as thioguanine or 8 -oxoguanine , deazapurine or 
azapurine) , or analogs of bases such as universal bases 
(such as nebularin, nitroindole or nitropyrrole 
20 derivatives) . Certain functions of the bases likely to 
interfere with the strategies of solid- or liquid-phase 
chemical synthesis may be protected with suitable 
protective groups . 

Preferably, the nitrogen-containing base is 
25 adenine, uracil or cytosine. 

The linker arm L is grafted onto any position of 
the nitrogen-containing base or of its analog. Preferably, 
the linker arm will be grafted onto a position which does 
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not disturb hybridization. In particular, the linker arm 
will be attached to the amine at position 4 of cytosine, 
position 5 of uracil or the .amine at position 6 of 
adenine . 

5 The term "protective group" is intended to mean 

the groups conventionally used in the chemical synthesis 
of nucleosides, nucleotides and oligonucleotides (see, for 
example, Chemistry of Nucleosides and Nucleotides, Edited 
by Leroy B. Townsend, Plenum Press, New York and London 

10 and Protocols for Oligonucleotides and Analogs, Synthesis 
and Properties, Edited by S. Agrawal, Humana Press, 
Totowa, New Jersey) . Preferably, in the case of chemical 
synthesis, R 4 is a 4 , 4 ' -dimethoxytrityl group and R 2 is a 
2 -cyanoethyl -N, N-diisopropylphosphoramidite group and R 3 is 

15 H or OR 5/ in which R s is a protective group used in 
oligoribonucleotide synthesis . 

Preferably, in the case of enzymatic synthesis, R 4 
is a triphosphate group, R 2 is H and R 3 is OH. 

The phosphate groups are generally in the form of 

20 salts, and particularly lithium, sodium or 

triethylammonium acetate salts. 

The invention also relates to a functional! zed 
polynucleotide comprising at least one f unctionali zed 
nucleotide as defined above. It may be synthesized by 

25 chemical and/or enzymatic reaction. In the case of 
synthesis by enzymatic reaction, and in particular in the 
case of enzymatic amplification, the neutrality of the 
f unct ionalized nucleotide with respect to enzymatic 
reactions allows the incorporation of several 

30 f unct ionalized nucleotides. 

The term "enzymatic reaction" includes all 
reactions in which at least one enzyme with activity which 
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is related to a nucleotide is involved. It is thus 
intended to mean all reactions comprising at least one 
enzymatic step in which a nucleotide is used as the 
substrate for the enzyme, regardless of whether said 
5 nucleotide is transformed during this enzymatic step. By 
way of example, such reactions are chosen from those used 
in molecular biology techniques such as transcription, 
ligation, elongation and cleavage and more particularly in 
amplification techniques (see, for example, the article by 
10 E. Winn-Deen, Journal of Clinical assay, vol 19, p21-26, 
(1996) ) . 

Thus, the enzymes which have activities which are 
related to nucleotides may, in particular, be selected 
from the following nonexhaust ive list: DNA-dependent DNA 

15 polymerases, such as the DNA polymerase I Klenow fragment 
of E . Coli, TAQ polymerase, the T7 , T4 or T5 DNA 
polymerases, eukaryotic cellular polymerases or viral 
polymerases; RNA-dependent DNA polymerases, such as AMV 
(avian, myoblastosis virus) or MMLV (Moloney murine 

20 leukemia virus) polymerases; RNA polymerases, such as the 
T7, T3 SPG , N4, or PBSII RNA polymerases, or E. Coli RNA 
polymerase; enzymes with nuclease activity, such as 
restriction endonucleases or Rnase H; or polyA 
polymerases, replicases such as Q-beta-replicase , terminal 

25 transferases or ligases. 

Heat-stable enzymes which have the enzymatic 
activities described above may also be used in the 
invention . 

According to a preferred embodiment, techniques 
30 using a transcription step, such as NASBA (nucleic acid 
sequence based amplification) , TMA (transcription mediated 
amplification) or post-PCR (polymerase chain reaction) 
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transcription, as described in the articles by 
R.J. Lipshutz et al, Biotechniques , 19(3), p442-447, 1995 
or M. Kozal et al , Nature Medecine , 2(7), p753-759, 1996, 
will be chosen for the synthesis of the f unctionalized 
5 polynucleotide. 

The elements and conditions required for carrying 
out these enzymatic reactions so as to produce a 
polynucleotide are well known to those skilled in the art. 
The manual Current Protocols in Molecular Biology, edited 

10 by F.M. Ausubel, R. .Brent, R.E. Kingston, D.D. Moore J.G. 
Seidman, J. A. Smith and K. Struhl , John Wiley & Sons, 1996 
volume 1 chapter 3, gives the methods for enzymatic 
manipulation of DNA and RNA. Similarly, the work 
"Molecular Methods for Virus Detection" , edited by 

15 D.L. Wiedbrand and D.H. Farkas, Academic press, San Diego, 
1995 in particular in chapters 8, 9, 12, 13, 14, 15 and 
16, gives examples for the enzymatic amplification 
techniques . 

The term "chemical synthesis" is intended to mean 
20 all methods, both in solid phase and in liquid phase, in 
which a suitably protected nucleotide monomer reacts with 
another nucleotide monomer or polymer via a coupling 
reaction. 

Chemical synthesis methods are given, for example, 
25 in "Methods in Molecular Biology, volume 20, Protocols for 
oligonucleotides and analogs", edited by S. Agrawal, 
Humana Press, Totowa, New Jersey, 1993 and "Methods in 
Molecular Biology, volume 26, Protocols for 

oligonucleotides conjugates", edited by S. Agrawal, Humana 
30 Press, Totowa, New Jersey, 1994. 

The term "polynucleotide" is intended to mean a 
chain of at least 2 nucleotide monomers. Preferably, if 
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the polynucleotide is synthesized by the chemical route, 
it is less than 3 00 nucleotides, and advantageously less 
than 150 nucleotides, in size. Preferably, if the 
polynucleotide is synthesized by enzymatic reaction, it is 
5 less than 2 0 kb, and advantageously less than 10 kb, in 
size . 

The two routes of synthesis, chemical synthesis 
and enzymatic synthesis, can be combined in order to 
prepare a polynucleotide. 
10 The invention also relates to a labeled 

f unctionalized polynucleotide comprising at least one 
f unctionalized compound of general formula (I') : 

W- [-L-] n-CHsCOR! . (i-) 

15 

in which 

W represents a nucleotide analog as defined above, 
L represents a linker arm comprising at least four 

atoms , 

20 n represents an index equal to 0 or 1, 

R L represents a linear or branched alkyl chain, 
the alkyl ketone group of said f unctionalized 
compound having interacted with a labeling reagent. 

W, L and advantageously satisfy the definitions 
25 given above to describe preferred functional! zed compounds 
of the invention. 

The term "labeling reagent" is intended to mean a 
tracer which directly or indirectly generates a detectable 
signal capable of reacting with the alkyl ketone function. 
30 The following is a nonlimiting list of these 

tracers : 




enzymes which produce a signal detectable, for 
example, by colorimetry, fluorescence or luminescence, 
such as horseradish peroxidase, alkaline phosphatase, 
beta-galactosidase or glucose- 6 -phosphate dehydrogenase ; 
5 - chromophores, such as fluorescent, luminescent 

or dye compounds ; 

groups with an electron density which can be 
detected by electron microscopy or through their 
electrical property, such as by conductivity, amperometry, 
10 voltametry, impedance measurements; 

groups which can be detected using optical 
methods, such as diffraction, surface plasmon resonance or 
contact angle variation, or physical methods, such as 
atomic force spectroscopy, the tunnel effect; 
15 - radioactive molecules such as 32 P, 35 S or 125 I. 

Preferably, the tracer is a fluorescent compound 
with low steric hindrance, such as fluorescein, dansyl , 
chromophores of the IR (Li -COR Inc, Lincoln NE, USA) , CY5 
and CY3 (Randolph J.B. and al , Nucleic Acids Res., 25(14), 
20 p2 923 -2 92 9, 1997) type and derivatives thereof. The term 
"low steric hindrance" is intended to mean a molecular 
weight of less than 1000 g/mol . 

In order to react with the alkyl ketone function, 
this labeling reagent must carry a nucleophilic function 
25 capable of reacting with an alkyl ketone function, such as 
alkoxyamine or hydrazine functions. 

Preferably, the function chosen is alkoxyamine, 
which may be introduced by any direct or indirect means. 
The term "direct means" is intended to mean a covalent 
30 bond between the tracer, or a molecule carrying the 
tracer, and the alkoxyamine function. The term "indirect 
means" is intended to mean complexation systems of the 
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metal/chelate type or affinity systems, i.e. haptens which 
can be detected by a specific antibody or a protein, such 
as the biotin/avidin or streptavidin pairing or the 
sugar/lectin pairing. In this case, the tracer is carried 
5 by the antibody or the protein and the alkoxyamine 
function is carried by the hapten, 

In particular, the labeling reagent has the 

formula : 




HNCNHCH 2 CH 2 CH 2 NHCCH 2 ONH 2 
II II 

s o 

10 



The invention also relates to a solid support to 
which a nucleotide, a nucleoside or a polynucleotide 
according to the invention is attached by covalence. 

15 in order to perform this attachment, a nucleotide, 

a nucleoside or a polynucleotide comprising an alkyl 
ketone group is reacted with a solid support on which 
there is an alkoxyamine or hydrazine function, preferably 
an alkoxyamine function. 

20 In a first embodiment, the polynucleotide is 

preformed and the final reaction consists in grafting, at 
a predetermined position on the support, the 
polynucleotide. In one particular embodiment, the 
polynucleotides are synthetic oligonucleotides 

25 (synthesized via the chemical route) which are short in 
size (less than 50 bases) , and in a second particular 
embodiment, the polynucleotides are more than 50 bases in 




size and are prepared using enzymatic methods such as 
enzymatic amplification . 

In a second embodiment, the nucleoside or 
nucleotide is added via successive steps (chain extension) 
5 on the support in order to obtain, at the end of the 
synthetic cycle, a polynucleotide grafted to a 
predetermined position on the solid support. 

A preferential use of these grafted supports is 
the production of biochips for gene analysis. 

10 By way of illustration, examples of these biochips 

are given in the 1 publications by G. Ramsay, Nature 
Biotechnology, 16, p40-44, 1998; F. Ginot, Human Mutation, 
10, pl-10, 1997; J. Cheng et al, Molecular diagnosis, 
1(3), pl83-200, 1996; T. Livache et al , Nucleic Acids 

15 Research, 22(15), p2915-2921, 1994; J. Cheng et al , Nature 
Biotechnology, 16, p541-546, 1998 or in patents 
US-A-4 981 783 (Augenlicht) , US-A-5 700 637 (Southern) , 
US-A-5 445 934 (Fodor) , US-A-5 744 305 (Fodor) , 

US-A-5 807 522 (Brown) . 

20 The invention also relates to a method for 

detecting a target nucleic acid in a sample, in which this 
target nucleic acid, optionally pretreated, is brought 
into contact with at least one f unct ionalized compound 
corresponding to formula (I'), in the presence of the 

25 elements and under conditions required for producing a 
polynucleotide of the invention, so as to produce a 
f unct ionalized polynucleotide; in labeling said 
polynucleotide with a labeling reagent and then in 
detecting said labeled polynucleotide. The abovementioned 

30 elements and conditions are well known to those skilled in 
the art . 

The term "pretreatment " is intended to mean the 
various steps for treating the sample in order to make the 
target nucleic acid accessible, such as for example 
35 lyzing, fluidifying, concentrating. 
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Preferably, the f unctionalized polynucleotide is 
obtained via an enzymatic amplification reaction which 
acts on the target nucleic acid which serves as a matrix, 
and which is capable of incorporating the f unctionalized 
5 nucleotide. 

Advantageously, the enzymatic amplification 
technique is NASBA (nucleic acid sequence based 
amplification) , TMA (transcription mediated amplification) 
or post-PCR (polymerase chain reaction) transcription, as 

10 described in the articles by R.J, Lipshutz et al, 
Biotechniques, 19(3), p442-447, 1995 or M. Kozal et al , 
Nature Medecine , 2(7), p753-759, 1996. 

The labeled polynucleotide may be detected 
qualitatively and/or quantitatively in homogeneous or 

15 heterogeneous phase. A preferential detection mode 
consists in attaching the labeled polynucleotide to a 
solid support via a hybridization reaction between the 
labeled polynucleotide and another polynucleotide, itself 
attached to the solid support, and then revealing the 

20 presence of the labeled polynucleotide after a washing 
step. This revelation is carried out directly by reading, 
such as for example with a scanner or a camera if the 
tracer is a fluorescent molecule. 

The detection method is particularly useful when a 

25 multitude of polynucleotides are attached to the solid 
support at a predetermined position so as to form a " DNA 
chip" . 

Specifically, the density of the polynucleotides 
attached to the solid support imposes considerable steric 
30 constraints during the hybridization and the labeled 
polynucleotide according to the invention allows good 
sensitivity of detection. Examples of these chips are 
given, for example cited in the publications and patents 
mentioned above . 




The detection method can be used for sequencing, 
profiling the expression of messenger RNAs or screening 
mutations, or diagnosing infectious or genetic diseases. 

A fragmentation step can take place in order to 
5 promote the hybridization of the labeled polynucleotide on 
the DNA chip, before, together with, or after the labeling 
step . 

The invention also relates to a method for 
detecting a target nucleic acid in a sample, in which this 

10 target nucleic acid is brought into contact with a 
f unctionalized polynucleotide of the invention, the 
labeling reagent is reacted and the presence of the target 
nucleic acid is detected. 

Another means of carrying out the invention is to 

15 react the f unctionalized polynucleotide and the labeling 
reagent, before the hybridization with the target nucleic 
acid. This target nucleic acid may have been amplified 
using an enzymatic amplification technique. 

Finally the invention relates to the method for 

20 detecting a target nucleic acid, according to which a 
labeled polynucleotide of the invention is available for 
use, the nucleic acid is brought into contact with the 
labeled polynucleotide and the presence of the target 
nucleic acid is detected. 

25 The term "solid support" as used herein includes 

all materials on which it is possible to immobilize a 
polynucleotide for use in diagnostic tests and in 
separation processes. Natural or synthetic materials, 
which may or may not be chemically modified, may be used 

30 as a solid support, in particular polysaccharides such as 
cellulose-based materials, for example paper, cellulose 
derivatives such as cellulose acetate and nitrocellulose, 
dextran; polymers such as polyvinyl chlorides, 
polyethylenes , polystyrenes, polyacrylates , polyamides, or 

35 copolymers based on monomers of the styrene type, 
unsaturated carboxylic acid esters, vinylidene chloride, 
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dienes or compounds with nitrile functions (such as 
acrylonitrile) ; vinyl chloride/propylene copolymers, vinyl 
chloride/vinyl acetate copolymers; natural fibers such as 
cotton and synthetic fibers such as nylon; inorganic 
5 materials such as silica, quartz, glasses, ceramics; 
latexes, i.e. colloidal aqueous dispersions of any water- 
insoluble polymer; magnetic particles; metallic 
derivatives, gels etc. 

The following examples make it possible to 
10 illustrate some of the advantages of the invention 
without, however, limiting the scope thereof. 

The attached figure represents the transcription 
yields measured as a function of the nucleotide used. 

15 EXAMPLE I : SYNTHESIS OF METHYL KETONE NUCLEOTIDES 

1.1. Synthesis of uridine (C5 ) -C9 -methylketone 1 




OH OH 
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1 

Synthetic pathway: 





OH OH 



The 2 1 , 3 ' -OH positions are protected by reacting 
acetone in acid medium according to the protocol described 
in "Nucleic acid chemistry, Editors Townsend-Tipson, 
5 Wiley-Interscience, John Wiley & Sons, p 765-766 (1978)". 



Synthesis of the methyl ketone chain 

The methyl ketone unit is introduced onto the arm 
by peptide coupling between propargyl amine and 6-oxo- 
10 heptanoic acid. 




IBuOCOCl 

N-methylmorpholine 
THF 
Yield: 80% 




15 6-Oxoheptanoic acid (2.75 g, 18.16 mmol) is 

solubilized in 4 0 ml of anhydrous THF. The solution is 
placed at 0°C under an argon atmosphere. N-methyl- 
morpholine (2 ml, 18.16 mmol) is then added, followed by 
propargylamine (1.25 ml, 18.16 mmol) 15 min later, and the 



cii 9i s feja?" 91 a „ o gt jl *9 o ■ 



18 



mixture is allowed to return to room temperature. After 
reaction for 30 minutes, the precipitate is filtered and 
evaporated to dryness. 

The residue obtained is taken up in 
5 dichloromethane and is washed with a 0.1 N aqueous sodium 
hydroxide solution, then with a 1 N aqueous hydrochloric 
acid solution and finally with water saturated with sodium 
chloride . 

After drying over Na 2 S0 4/ the dichloromethane is 
10 evaporated off and the residue is chromatographed on 
silica gel (eluent: ethyl acetate). The product is thus 
obtained in the form of a white powder (2.6 g, 14.4 mmol; 
80%) . 

The methyl ketone chain was characterized by 
15 proton NMR, carbon 13 NMR and mass spectrometry. 

Introduction of the methyl ketone chain onto the 
nucleoside: Heck coupling (Hobbs, J. Org. Chem. , 1989, 54 , 
3420-3422) 

20 




25 5-Iodouridine 2 1 , 3 ' - isopropylidene (500 mg; 

1.22 mmol) and copper iodide (44 mg, 0.2 32 mmol) are added 
to 5 ml of DMF which has been degassed and placed under 
argon. The reaction is placed in darkness and then 
triethylamine (323 jxl , 2.32 mmol) and the chain containing 

30 the methyl ketone function (63 0 mg, 3.4 8 mmol) are added. 
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The mixture is left under argon for 10 minutes. 
Tetrakis triphenylphosphine palladium (134 mg, 0.116 mmol) 
is then added. After reaction for 3 hours, the DMF is 
evaporated off, followed by coevaporat ion with 
5 acetonitrile . The residue is taken up in ethyl acetate and 
the organic phase is washed with an aqueous solution 
saturated with sodium chloride. After drying over Na 2 S0 4 
and evaporation, the residue is chromatographed on silica 
gel (eluent: ethyl acetate/methanol : 90/10). 

10 After evaporation, the methyl ketone nucleoside is 

obtained in the form of a whitish powder (340 mg, 
0.73 mmol, 60%). The product was characterized by proton 
NMR, carbon 13 NMR and mass spectrometry. The methyl 
ketone nucleoside correctly protected for the introduction 

15 of the triphosphate in 5' is thus obtained. 

Production of the uridine ( C5 ) -C9 -me thylke tone 
triphosphate nucleoside 1_ 

Phosphorylation: Eckstein, J. Org. Chem. , 1989, 

20 54, 631-63 5 




25 The methyl ketone nucleoside (46 mg, 0.1 mmol) is 

dissolved in anhydrous pyridine and evaporated twice. 
100 ml of pyridine, 300 ml of dioxane and a freshly 
prepared solution of 2 -chloro-4H- 1 , 2 , 3 -dioxaphosphorin-4 - 
one (1 M) in dioxane (13 0 pi; 13 0 /imol) are then added 

30 under argon, the mixture is left to stir for 20 minutes, 
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and then a 0.5 M solution of tributyl ammonium 
pyrophosphate in anhydrous DMF (320 fil , 0.16 mmol) and, 
simultaneously, 130 fil of tributylamine are then added. 
After 3 0 minutes, 2 ml of 1% iodine solution are added to 
5 a pyridine/water (98/2: v/v) mixture. 

After stirring- for 20 minutes, the excess iodine 
is eliminated with a 5% aqueous NaHS03 solution and the 
stirring is continued for 10 minutes. The mixture is 
evaporated to dryness and an extraction is carried out 

10 with a water/dichloromethane mixture. The aqueous phase is 
evaporated off and then CI 8 reverse phase chromatography 
(flash) is performed (eluent: water/methanol l/l) . The 
fractions containing the product are evaporated and the 
counter- ion exchange is carried out by passing over Dowex, 

15 Na+ resin. The protected triphosphate is thus recovered 
(0 . 03 mmol ; 30%) . 

Deprotection 

20 




1 



25 The protected triphosphate (0.03 mmol) is taken up 

in 15 ml of milli-Q water to which 15 ml of a 25% aqueous 
TFA solution are added. The solution is stirred for 15 
minutes, then evaporated and coevaporated twice with 
water . 
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The product is taken up in 10 ml of milli-Q water 
and neutralized with 0.1 N sodium hydroxide, to pH 8. 
After the evaporation, the protected triphosphate is 
purified on C18 reverse phase (H 2 0 then H 2 0/MeOH, 1/1) . The 
5 counter- ion exchange is carried out by passing over cation 
exchange resin (Dowex Na + ) . The fractions containing the 
product are evaporated and assayed. 0.021 mmol (70%) of 
the uridine (C5) -C9-methylketone triphosphate nucleoside 1. 
is thus recovered, and this nucleoside was characterized 
10 by proton NMR, carbon 13 NMR and phosphorus 31 NMR. 
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1.2. Synthesis of adenosine (N6) -CIO -me thylketone 2 



9 9 
-o- p— o- p— o- 

o- o 




OH OH 



Synthetic pathway 

The introduction of diaminobutane at position 6 of 
the adenosine is carried out - by substitution of the 
triazolo group carried by the protected intermediate 

10 nucleoside. The methyl ketone function is then introduced 
by peptide coupling at the nucleoside level. After 
deprotection of the silyl group in 5', phosphorylation is 
carried out by the Eckstein method. After deprotection, 
the expected triphosphate is obtained, which is 

15 characterized by proton NMR and phosphorus 31 NMR . 



2/ TBDMS-CI 
pyridine 



TBDMS-O 



65% 



1/ H 2 N- 



21 Hi 




50 °C 



O 

iBuOCOCI, 
N-methylmorpholine 

3/TBAF 



PPP-O^-^O^J 




V 'n 

Yield: 36% 



OH OH 



1/ Eckstein 
2/ aq. 25% TFA \^ |7 



Yield: 70% 
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Synthesis of the protected triazolo nucleoside 

Preparation of adenosine 2 ' , 3 f - isopropvlidene 
(Nucleic Acid Chemistry, part 2, Editors Townsend, 
Tipson, Wiley Interscience, John Wiley & Sons p. 768, 
5 (1978) ) 

Ethyl orthoformate (12.44 ml, 74.8 mmol) is added 
dropwise, under argon, to a suspension of adenosine (5 g, 
18.7 mmol) in acetone (10 ml) containing APTS (para- 
toluene-sulfonic acid) (3.9 g, 20.6 mmol). After reaction 

10 overnight, 110 ml of water containing 1.86 ml of aqueous 
ammonia at 2 7% are added. After stirring for 3 0 minutes, 
the reaction mixture is evaporated until white crystals 
appear. After 12 h at +4°C, a white precipitate is 
obtained which is recrystallized in water. 4.17 g 

15 (13.5 mmol, 72%) of product is obtained in the form of a 
white powder. This intermediate was characterized by 
proton NMR. 

Synthesis of amidine (Bartlett and Humphreg, J". 
20 Chew. Soc, 1967, 1664-1666) 



OHC-NH-NH-CHO + SOCi 2 




, 2HCI 



Thionyl chloride (39.96 g, 24.5 ml, 0.338 mol) is 
25 added dropwise to N-N' -diformyl hydrazine (12 g, 0.136 mol) 

in DMF (270 ml) at 10°C. The mixture becomes yellow. 

Stirring is maintained for 2 days. The precipitate 

obtained is filtered and washed with DMF then with ether. 

After drying under a vacuum, the amidine is obtained with 
30 a yield of 95% (28 g, 0.130 mol) . 




Preparation of the triazolo intermediate (Samano, 
Miles, Robins, J. Am. Chem. Soc . , 1994, 116 , 9331-9332) 

The adenosine isopropylidene (1 g, 3.2 mmol) and 
the amidine described above (1.4 g, 6.5 mmol) are stirred 
5 into pyridine (15 ml) at 100 °C under argon for 4 8 h. The 
pyridine is then evaporated off and coevaporated with 
toluene. The oil obtained is then taken up with ethyl 
acetate and this organic phase is washed with water 
saturated with NaCl . After drying over Na 2 S0 4 and 
10 evaporation, the triazolo nucleoside is obtained in the 
form of a white powder with a yield of 60% (700 mg, 
1.9 mmol) . It was then characterized by proton NMR. 

Protection in 5' of the triazolo derivative 
The triazolo nucleoside (1 g, 2.8 mmol) is 
solubilized in 20 ml of pyridine. TBDMS - Cl (tert- 
butyldimethylsilyl chloride) , (462 mg, 3 mmol) is added at 
0°C under argon. Stirring is maintained for two hours then 
the pyridine is evaporated off. The residue thus obtained 
is chromatographed on silica gel (eluent : CH 2 Cl 2 /methanol : 
95/5) . After evaporation, the totally protected 
intermediate adenosine is obtained in the form of a white 
powder (1.25 mg, 2.6 mmol, 93%). This nucleoside was 
characterized by proton NMR, carbon 13 NMR and mass 
spe c t r ome t ry . 

Introduction of the diaminobutane chain onto the 
protected intermediate adenosine 

The triazolo adenosine (1.25 mg, 2.64 mmol) is 
30 solubilized in 10 ml of acetonitrile . Diaminobutane 
(2.7 ml, 2 5.4 mmol) is then added and the mixture is 
stirred at 50 °C under argon. After 5 hours, the solvent is 
evaporated off and the residue is taken up in ethyl 
acetate. This organic phase is washed with water saturated 
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with NaCl . After drying over Na 2 S0 4 and evaporation by 
chromatography on silica gel (eluent : CH 2 Cl 2 /MeOH: 8/2, 
then CH 2 Cl 2 /MeOH: 8/2 in the presence of 2% aqueous 
ammonia) . 

5 After evaporation, the product is obtained in the 

form of an oil (1 g, 2 mmol , 8 0%) . 

The aminated nucleoside was characterized by 
proton NMR, carbon 13 NMR and mas.s spectrometry. 

10 Coupling to the methyl ketone chain and 

deprotection in 5' 

6-Oxo-heptanoic acid (288 mg, 2 mmol) is dissolved 
in 5 ml of anhydrous THF . The solution is placed at 0°C 
under argon. N-Methylmorpholine (223 fil , 2 mmol) is then 

15 added, followed by isobutyl dichlorof ormiate (258-/zl, 
2 mmol) 5 min later. After 15 minutes, the aminated 
nucleoside is added. After 2 hours, the THF is evaporated 
off and the residue is taken up with ether. This is washed 
with a IN aqueous NaOH solution and then with water 

20 saturated with NaCl . After drying over Na 2 S0 4 and 
evaporation, an oil is obtained. The desilylation is 
carried out by taking the oil up in 10 ml of THF, to which 
TBAF (3.25 ml of a 1M solution in THF) 'is added. After one 
hour, the solvent is evaporated off. The residue is 

25 solubilized in dichloromethane and washing is carried out 
with water saturated with NaCl. After drying and 
evaporation, the residue is chromatographed on silica gel 
(eluent: CH 2 C1 2 , then CH 2 Cl 2 /MeOH: 95/5). 

After evaporation, 870 mg of the nucleoside 

30 carrying the methyl ketone function and deprotected in 5' 
(1.7 mmol, 85% over 2 steps) is obtained. It is carried by 
proton NMR, carbon 13 NMR and mass spectrometry. 




Phosphorylation and production of the adenosine- 
(N6 ) -CIO -m&thyl ketone nucleotide 2 

The adenosine- (N6) -ClO-methyl ketone protected in 
2', 3' by isopropylidene is dissolved in anhydrous 
5 pyridine and is evaporated twice. 500 /xl of pyridine, 
1 . 5 ml of dioxane and a freshly prepared solution of 2- 
chloro-4H-l , 2 , 3 -dioxaphosphain-4 -one (1 M) in dioxane 
(650 fxl , 0.65 mmol) are then added under argon. Stirring 
is allowed to proceed for 20 minutes and then a 0.5 M 

10 solution of tributylammonium pyrophosphate in anhydrous 
DMF (1;6 ml, 0.8 mmol) is added simultaneously with 650 fil 
of tributylamine . 

After 30 minutes, 10 ml of 1% iodine solution is 
added to a pyridine/water (98/2, v/v) mixture. After 20 

15 minutes, the excess iodine is destroyed with a 5% aqueous 
NaHS0 3 solution and stirring is maintained for 10 minutes, 
the mixture is evaporated to dryness and 
water/dichloromethane extraction is performed. The aqueous 
phase is evaporated off and then reverse phase (C18) 

20 purification is carried out (eluent : H 2 0/MeOH) . The 
fractions containing the product are evaporated and the 
counter- ion exchange is carried out by passing over Dowex, 
Na+ resin. The protected triphosphate (0.28 mmol, 56%) is 
thus recovered. 

25 

Devrotection 

0.035 mmol of protected triphosphate are taken up 
in 17.5 ml of milli-Q water, to which 17.5 ml of a 25% 
aqueous TFA solution are added. The mixture is stirred for 
30 15 minutes and then evaporated and coevaporated twice with 
water, and the product is taken up in 10 ml of milli-Q 
water and neutralized with 0.1 M sodium hydroxide, to 
pH 8. After evaporation, a C18 purification is carried out 
(eluent: H 2 0; H 2 0/MeOH) . The fractions containing the 




product are evaporated and assayed. 0.022 mmol (63%) of 
the adenosine- (N6) -CIO -methyl ketone triphosphate 2 are 
thus obtained. 

5 T.3. Synthesis of cytidine- (N4 ) -CIO -methyl ketone 

3 




3 

10 This nucleotide was prepared in two ways: 

- Nucleotide pathway: coupling between an 
activated ester of the methyl ketone chain and aminated 
cytidine triphosphate . 



ppp 

15 




- Nucleoside pathway: synthesis of the methyl 
ketone nucleoside then phosphorylation by the Eckstein 
method . 

20 
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EXAMPLE II : SYNTHESIS OF THE OX Y AMINE FLUOROPHORE 



XX. 1. Synthetic schemes 

The oxyamine chain is introduced onto the 
fluorescein in three steps: the first step is nucleophilic 
10 addition of 1 , 3 -diaminopropane to fluorescein 

isothiocyanate (FITC) . After reverse phase purification, 
the oxyamine unit protected in the form of Fmoc is 
introduced by peptide coupling. The free oxyamine is 
generated by deprotection in basic medium. 

15 





5 Introduction of the 1 , 3 -diaminoproipane chain: 

Diaminopropane (585 /xl, 6.96 mmol) is added to 
2 0 ml of anhydrous DMF . Fluorescein isothiocyanate (FITC) 
(500 mg, 1.16 mmol), solubilized in 7 ml of anhydrous DMF, 
is then added, dropwise, under argon. Stirring is 
10 maintained for 15 minutes after the FITC has been added. 
The mixture is evaporated to dryness and coevaporated 
twice with water. The residue is reverse phase (C18) 
chromatographed: (eluent: H 2 0/MeOH: l/l). 

The product is thus recovered in the form of an 
15 orange powder (420 mg, 0.93 mmol, 80%). It is 
characterized by proton NMR, carbon 13 NMR and mass 
spectrometry . 



Introduction of the protected alkoxvamine group: 



20 Carboxyalkoxyamine protected with Fmoc : HOOC-CH 2 - 

ONH-Fmoc (473 mg, 1.5 mmol in 10 ml of anhydrous DMF) is 
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solubilized. It is placed at 0°C under argon. N- 
Methylmorpholine (166 /il, 1.5 mmol) is then added and, 
after 15 minutes, the fluorescein carrying the 
diaminopropane chain (3 50 mg, 0.7 5 mmol) is added. After 
5 reacting for 1 hour, the DMF is evaporated off to dryness. 
The residue obtained is chromatographed on silica gel 
(CH 2 Cl 2 /MeOH: 85/15 (solid deposit)). The protected 
f luorescein-alkoxyamine is thus obtained in the form of an 
orange powder (227 mg, 0.3 mmol, 40%). It is characterized 
10 by proton NMR, carbon 13 NMR and mass spectrometry. 

Production of the fluorescein-alkoxvamine label 4 : 
The fluorescein protected with fmoc (100 mg, 
0.13 mmol) is solubilized in 2 ml of anhydrous DMF. 

15 Pyridine (2 0 (jlI , 0.2 mmol) is then added. After 15 
minutes, the mixture is evaporated to dryness and then C18 
reverse phase purification is carried out (eluent : 
H 2 0/CH 3 CN: l/l then CH 3 CN) . After evaporation, the product 
is obtained in the form of an orange powder (49 mg, 

20 0.09 mmol, 70%). This fluorophore was characterized by 
proton NMR, 13C NMR and mass spectrometry. 



25 EXAMPLE III : REACTIVITY OF THE METHYL KETONE 

COMPOUNDS WITH THE FLUOROPHORE 



30 



The reactivity was tested at the nucleoside level 
and at the nucleotide level: 
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R = -OH, triphosphate 



The reaction is carried out in the presence of 
1.1 eq. of f luorophore-ONH 2 (4) with respect to the methyl 
5 ketone compound. The reaction is rapid and selective. At 
the nucleoside level, the adducts were characterized by- 
proton NMR and mass spectrometry. 

EXAMPLE IV : INCORPORATION OF THE URIDINE -METHYL 
10 KETONES AND POST -TRANSCRIPTION LABELING 

IV. 1. Description of the main steps 
Transcriptions 

The transcriptions were carried out on a PCR 

15 target (fragment of the 16 S RNA of Mycobacterium 
tuberculosis (Mtb) ) (Troesch A. et al , J. Clin. 
Microbiol., 37(1), 49-55, 1999) or a fragment of HIV 
reverse transcriptase (Kozal M.J. et al , Nature Medecine, 
2(7), 753-759, 1996) using T7 RNA polymerase and various 

20 ratios between the f unctionalized nucleotide and the 
natural nucleotides, while at the same time keeping the 
total concentration of each nucleotide at 1 mM. This ratio 
between the f unctionalized nucleotide and the 
corresponding natural nucleotide is expressed as a 

25 percentage and the ratios used are in general 0, 30, 70 
and 100%. The point 0% is used as a transcription control 
since, in this case, there is no f unctionalized nucleotide 
and the transcription reaction comprises the 4 natural 
nucleotides. When the ratio is 100%, this means that the 

30 f unctionalized nucleotide represents 100% of the 
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nucleotide studied (the other three nucleotides required 
for the transcription reaction naturally being the natural 
nucleotides) . This ratio of 100% for a f unctionalized 
nucleotide is the most significant test of the neutrality 
5 with respect to an enzymatic reaction since the enzyme 
must incorporate this nucleotide in order to function 
correctly. The incubation time for the transcription 
reaction is 1 h at 42°C. 

The transcriptions are analyzed by polyacrylamide 

10 gel electrophoresis under denaturing conditions (6% 
acrylamide, 7 M urea, 1XTBE) . The volume loaded is 5 jil 
and the migration takes place for 45 min at 150 V. The 
transcripts, natural or functional i zed with the methyl 
ketone function, are visualized under a UV lamp after 

15 staining with ethidium bromide. 



Assay 

The amount of 
reaction is determined by 
20 an aliquot derived from a 



transcripts produced in each 
UV assay, after purification, of 
transcription reaction . 



Enzymatic digestion 

The transcripts are purified on microcon-50 
filters (Amicon, Beverly, MA) in order to remove the 

25 excess of unincorporated nucleotides. They are then 
hydrolyzed according to the protocol described in patent 
application WO 98/05766 using PI nuclease (Boehringer 
reference 2362251, 2U, 2 h at 37°C) and alkaline 
phosphatase (Boehringer-Mannheim reference 713023 , 1U, 1 h 

30 at 37°C) . The digestions are carried out on 4 x 10 14 
transcript copies. The nucleoside composition is 
determined by reverse, phase HPLC, by comparing with 
nucleoside standards composed of a mixture of natural 
nucleosides . 
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The HPLC conditions are as follows: 

- analytical C18 column (250 x 4.6 mm) heated to 

45°C, 

- eluents: A: 50 mM sodium phosphate buffer, pH 7; 
5 B: MeOH/H 2 0: 95/5, v/v 

- gradient: 0% of B for 10 min, up to 30% of B in 
10 min, up to 80% of B in 10 min, 5 min at 80% of B, up to 
100% of B in 2 min. 

10 Labeling 

The transcripts are labeled using various 
proportions of fluorophore. The reaction time is 3 0 min at 
room temperature. The labeling is carried out on the 
transcripts generated from an Mtb target and/or HIV 
15 target, obtained by incorporation of 100% of a methyl 
ketone nucleotide. Initially, the labeled transcripts are 
analyzed by polyacrylamide gel electrophoresis and 
visualized under UV, before and after staining with 
ethidium bromide. 

20 

Cleavage 

Before hybridizing on the DNA chip, the labeled 
transcripts are cleaved at 65 °C for 30 min using imidazole 
25 and manganese chloride (MnCl 2 ) each at a concentration of 
30 mM. 

Hybridization on a DNA chip 

After cleavage, the fragments obtained are 
30 hybridized, detected and analyzed on a DNA chip 
(Affymetrix, Santa Clara, CA, USA) according to the 
protocol supplied by the manufacturer. 

The "myco" chips are designed for resequencing the 
213-415 region of the "Genbank" M20940 sequence of the 16S 
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RNA of Mycobacterium tuberculosis (Troesch A. et al, J . 
Clin. Microbiol., 37(1), 49-55, 1999). Those termed "HIV 
PRT 440" are designed for resequencing the RT (reverse 
transcriptase) and protease regions of the HIV virus 
5 (Kozal M.J. et al, Nature Medecine, 2(7), 753-759, 1996). 

In the case of the Mtb transcript, 10 14 copies are 
hybridized on the chip. 

In the case of the HIV transcript, 5 x 10 12 copies 
are hybridized on the chip. 

10 

IV. 2. Results 

Incorporation of the uridine ( C5 ) -C9 -methyl ketone 
nucleotide 1 (U-COCH 3 ) 

15 



Target used 


Ratio between 
f unctionalized 
nucleotide 
U-COCH3 and natural 
nucleotide 


Transcripts produced 
(copies /microliters) 


Mtb 


0% 


5 . 0 E +13 




30% 


5 . 0 E +13 




70% 


3 . 0 E +14 




100% 


0 . 8 E +14 


HIV 


0% 


0 . 9 E +12 




70% 


0 . 9 E +12 




100% 


0, 9 E +12 



The results in the table above show that the 
uridine-methyl ketone nucleotide {!) is very well 
incorporated by T7 RNA polymerase. This allows the 
20 production of activated transcripts capable of reacting 
with a label carrying the alkoxyamine function. 

Evaluation of the incorporation by HPLC 




The peak which corresponds to the U-methyl ketone 
nucleoside (retention time 26.2 minutes) is indeed 
observed in the chromatographic profile of the hydrolyzate 
of the transcripts containing 100% of U-COCH 3 (1) and 
5 obtained from the Mtb target, which means that the 
nucleotide is not modified during the transcription and 
enzymatic digestion steps. The absence of the peak 
corresponding to the natural uridine (retention time 14.97 
minutes) is also noted. 
10 This shows that the U-methyl ketone nucleotide is 

correctly incorporated and can be used at 100% in a 
transcription reaction . 

Evaluation of the labeling by polyacrylamide gel 

15 The labeling was carried out on the crude 

transcripts generated from an Mtb target, using 100% of 
the U-COCH3 nucleotide (1) . The amount of f luorophore 
compared to the number of reactive sites was varied. These 
ratios are: 2, 5, 10, 15, 20 and 50 equivalents of 

20 f lurophore-ONH 2 . 10 equivalents are suitable for intensive 
and selective labeling. 

On a gel, before staining with ETB, representing 
labeling using 5, 10 and 15 eq. of the f luorophore, 
labeling is observed which is more or less intense 

25 depending on the number of equivalents of fluorophore 
used, but there is a visible band whatever the amount of 
labels used. The appearance of new bands is also noted at 
high doses of fluorophore. This is certainly due to a high 
density of the label on the target. 

30 

Hybridization and analysis of the results 

The Mtb and HIV transcripts generated from 
corresponding targets, using 100% of UTP-COCH 3 (JL) , 
underwent the treatments described above. During the 
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cleavage step, a "blocking" agent (acetone or 
glutaraldehyde) was used in order to avoid saturating the 
chip with the excess of f luorophore-ONH 2 which adsorbs to 
the surface of this chip. The hybridization is carried out 
5 using the "Gene Chip fluidics Station" for hybridization 
(800101 Affymetrix, Santa Clara, CA) , the suitable chips 
and buffers and the protocol supplied by the manufacturer. 

The base call percentage, mean signal intensity, 
10 median intensity and background noise parameters were 
calculated using the software provided by the manufacturer 
(GenChip sequence analysis system, reference 900135, 
Affymetrix, Santa Clara, CA) . The results are given in the 
two tables below. The signal intensity is expressed in RFU 
15 (fluorescence units of the manufacturer) . 



MtJb transcripts: 



Target 


Base 
call 
(%) 


Mean 
intensity 
(Rfu) 


Median 
intensity 
(Rfu) 


o Mycobacterium tuberculosis 

100% U-COCH 3 transcripts 








100 eq Fluo-ONH 2 + 10 0 eq 
acetone 


97 . 7 


3540 


3260 


2 0 eq Fluo-ONH 2 + 20 eq 
glutaraldehyde 


92 .4 


12630 


11970 



Base call: percentage of bases correctly 



20 identified. 




HIV transcripts 



Target 


Base 
call 
(%) 


Mean 
intensity 
(Rfu) 


Median 
intensity 
(Rfu) 


• HIV 

70% U-COCH 3 transcripts 

10 eq Fluo-ONH 2 + 10 eq' 

glutaraldehyde 

100% U-COCH 3 transcripts 

10 eq Fluo-ONH 2 + 10 eq 

glutaraldehyde 


• 98.8 
99 . 0 


2175 
2675 


1765 
2090 


100% U-COCH 3 transcripts 

5 eq Fluo-ONH 2 without blocking 


97 . 5 


4750 


4095 


100% U-COCH 3 transcripts 
5 eq Fluo-0NH 2 + 5 eq 
glutaraldehyde 


98 . 5 


1825 


1550 


100% U~COCH 3 transcripts 

2 eq Fluo-0NH 2 without blocking 


98.3 


2380 


2035 


100% U-COCH 3 transcripts 
2 eq Fluo-0NH 2 + 2 eq 
glutaraldehyde 


98 . 8 


620 


540 



In both the Mtb and HIV cases, very effective 
5 labeling with intensities greater than 1000 Rfu and 
resequencing (base call) percentages close to 100% are 
observed. These results show that the reaction between the 
alkoxyamine and methyl ketone, demonstrated at the monomer 
stage, is therefore specific and effective between RNA 
10 sequences containing methyl ketone functions and the 
fluorescent label carrying the alkoxyamine function. 
Specifically, a sufficient amount of labeled and detect- 
able transcript fragments are obtained with only 2 equi- 
valents of alkoxyamine label per methyl ketone function. 




EXAMPLE V : INCORPORATION OF THE CYTIDINE -METHYL 
KETONE 3 AND POST -TRANSCRIPTION LABELING 

The cytidine 3. was incorporated at 100% into RNAs 
5 transcribed from Mtb, as described in Example III. The 
cleavage and labeling method is also carried out as 
indicated above. 



Target 


Base 
call 
(%) 


Mean 
intensity 
(Rfu) 


Median 
Intensity 
(Rfu) 


100% C-COCH 3 transcripts 
2 0 eq Fluo-ONH 2 + 20 eq 
glutaraldehyde 


94 . 2 


6655 


6200 



10 Here again, very effective labeling is observed. 

The signal intensity is greater than 6000 Rfu and the 
resequencing is 94%. As in the previous example, this 
result demonstrates that the reaction between the alkoxy- 
amine and methyl ketone is specific and effective between 

15 RNA sequences containing methyl ketone functions and the 
fluorescent label carrying the alkoxyamine function. 

EXAMPLE VI : ADVANTAGES OF THE METHYL KETONE 
FUNCTION COMPARED TO ALKOXYAMINE, ALDEHYDE AND AMINE 
20 FUNCTIONS . 

1. Comparison between the alkoxyamine and methyl 
ketone function on the U nucleotide. 

Uridine carrying an alkoxyamine chain at position 
25 5 is prepared according to the method described in Example 
4 of patent application WO-A-98/05766. The synthesis of 
the fluorescein aldehyde (FLUO-CHO) label is described in 
Example 18 of application WO-A-98/05766. 




It is important to note that the alkoxyamine 
function requires permanent protection with the tert- 
butoxycarbonyl (BOC) group. This function is deprotected 
at the same time as the isopropylidene in 2' ,3' after 
5 phosphorylation in the presence of 50% trif luoroacetic 
acid. This percentage of TFA is necessary to obtain a 
deprotection of the alkoxyamine close to 90%. In order to 
obtain total deprotection, more concentrated acid 
solutions are necessary. In these solutions, the 

10 triphosphate degrades very rapidly to diphosphate and 
monophosphate . 

In the case of the methyl ketone function, no 
deprotection is necessary and only 25% of TFA is used for 
total deprotection of the isopropylidene in 2* ,3' after 

15 phosphorylation. No degradation is observed with this 
percentage of TFA. 

The methyl ketone chain is sufficiently 
hydrophobic even after deprotection; it thus contributes 
to good separation of the nucleotide triphosphate when it 

20 is purified by reverse phase HPLC . This separation is more 
difficult in the case of the alkoxyamine chain. 

Incorporation and post-labeling of the UTP- 
alkoxyamine 

25 The reactions for transcription, enzymatic 

hydrolysis of the transcripts, HPLC analysis and post- 
labeling are carried out under the same conditions as 
those described in Example IV. 

30 The figure shows the transcription yields measured 

in terms of the amount of amplicons produced (on the 
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y-axis in copies/microliters) as a function of the 3 
nucleotides used. 

In this example, the incorporation of the two 
5 nucleotides, alkoxyamine (U-ONH 2 ) and methyl ketone 
(U-COCH3 compound 1) , is studied. 

The uridine -methyl ketone 1 at 100% has no effect 
on the transcription yield. This yield is lower when the 
same nucleotide carrying the alkoxyamine function is 
10 incorporated. 

Post -labeling : 



MTB target 


Base 


Mean 


Median 




call 


intensity 


intensity 




(%) 


(Rfu) 


(Rfu) 


100% U-ONH 2 TRANSCRIPTS + 








10 EQ 


91 . 6 


5735 


5335 


FLUO-CHO 









The resequencing percentage using the uridine - 
15 alkoxyamine is lower than that obtained with its homolog 
carrying the methyl ketone function. 

2 • Comparison between the aldehyde and methyl 
ketone function on the U nucleotide. 

20 The uridine with an aldehyde chain (U-CHO) is 

prepared according to the following scheme: 




OH OH 

The chain is synthesized by peptide coupling 
5 between 4 -aminobutyraldehyde diethylacetal and 4-pentynoic 
acid. Heck coupling produces the protected nucleoside. 
Phosphorylation followed by deprotection in acid medium 
correctly produces the expected nucleotide. The products 
were characterized by X H NMR and 13 C NMR. 

10 Here again, it is necessary for the aldehyde 

function to be protected during the coupling and 
phosphorylation steps. The deprotection of the aldehyde is 
carried out at the same time as that of the isopropylidene 
(25% of TFA) ; however, the purification and desalination 

15 of the deprotected product are more difficult compared to 
those of the nucleotide-methyl ketone (U-COCH 3 , compound 
1) . The yield from the phosphorylation step is 20%. 

The HPLC analysis of the transcripts containing 
the UTP-aldehyde , hydrolyzed using the enzymatic technique 
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described in Example IV, shows that the nucleotide- 
aldehyde is transformed. This transformation is certainly- 
due to the aldehyde function interacting with components 
of the buffers during the transcription or enzymatic 
5 hydrolysis of the transcripts. 

The post-labeling reaction using the fluorescein- 
alkoxyamine shows no result after analysis of the 
mycobacterium transcripts on gel and on DNA chips. This 
shows that transformation of the aldehyde has occurred 
10 during the transcription step, which shows the advantage 
of the methyl ketone function compared to the aldehyde. 

3 . Comparison between the amine and methyl ketone 
function on the IT nucleotide. 

15 The uridine carrying an amine chain at position 5 

(U-NH 2 ) is prepared according to the method described in 
Example 3 of patent application WO 98/05766. It is 
necessary to protect the amine function with the Boc group 
in this case as well . 

20 The transcription yields obtained with 100% of 

U-NH 2 are much lower than when the methyl ketone function 
is used and do not allow efficient resequencing on the DNA 
chip both in the case of Mtb and in the case of HIV. 

25 4. Comparison between the amine and methyl ketone 

function on the C nucleotide. 

The cytidine carrying an amine chain at position 4 
(C-NH 2 ) is prepared according to Example 16 of patent 
application WO-A- 98/05766 . 
30 This nucleotide was incorporated into fragments of 

the 16S RNA of Mycobacterium tuberculosis through 




10 



transcription reactions carried out on a PCR target 
according to the protocol described in Example IV. 

The HPLC analysis of the transcripts containing 
the C-NH 2/ hydrolyzed using the enzymatic technique 
described in the example above, shows that the aminated 
nucleotide is correctly incorporated and is intact. 

The yields from transcription with various 
percentages of C-NH 2 , also determined as indicated in 
Example IV, are given in the table below. 



Ratio between f unctionalized 
nucleotide C-NH 2 
and natural nucleotide 
(%) 


Transcripts produced 
(copies / microliters) 


0 


3 . 48 E +13 


70 


1 . 94 E +13 


100 


1 . 51 E +12 



It is important to note that, with 100% of C-NH 2 , 
the amount of amplicons obtained is 20 times less than 
that obtained with transcription containing 100% of 
15 natural nucleotide. 

The labeling reagent used in the post-labeling 
reaction was fluorescein isothiocyanate obtained from 
Sigma-Aldrich (St Quentin Falavier, France) . 



20 




Fluorescein isothiocyanate (FITC) 



44 



9 6 , 0! 919 0 I 



The labeling of the transcripts obtained by 
incorporation of the C-NH 2 were labeled with FITC, 
hybridized on the DNA chip intended for the identification 
of these Mtb targets, and the signals were analyzed 
5 according to the protocol described above. The results 
obtained with and without blocking agent are given in the 
table above . 



Target 


Base call 
(%) 


Mean intensity 
(Rfu) 


Nonpurified transcripts + 
100 eq FITC 


ND 


ND 


Nonpurified transcripts + 
100 eq FITC +1000 eg of 
blocking diamine 


86 


346 



10 ND : not determined, the surface of the chip is 

completely saturated . 

B 1 o c k i ng di ami ne : H 2 N - CH 2 - CH 2 - O - CH 2 - CH 2 - O - CH 2 - CH 2 - 

NH 2 . 

15 If the diamine chain is not added after the 

labeling, the surface of the chip is completely saturated 
with the 10 0 eq of label and the signals cannot be 
exploited. This saturation is certainly due to the 
interaction of the isothiocyanate function with the 

20 exocyclic amine functions of the bases. 

The post-labeling addition of 1000 eq of diamine, 
having the same structure as that carried by the cytidine, 
appears to confirm this hypothesis since the surface of 
the chip is no longer saturated, and 86% resequencing is 

25 obtained. The signal intensity is low compared to that 
obtained with transcripts containing the methyl ketone and 
labeled with f luorescein-alkoxyamine . 




In addition to the inhibitory effect of the C-NH 2 
during the transcription, the resequencing percentage when 
the C-NH 2 is used is lower than that obtained with its 
homolog carrying the . methyl ketone function (C-COCH 3/ 
5 compound 3.) . 



